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SEQUENCE LISTING 

5 (1) GENERAL INFORMATION 

(i) APPLICANT: MICHALOVICH, DAVID 
HAYES, PHILIP DAVID 

10 (ii) TITLE OF THE INVENTION: NOVEL COMPOUNDS 

(iii) NUMBER OF SEQUENCES: 4 

15 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Ratner & Prestia 

(B) STREET: P.O. Box 980 

(C) CITY: Valley Forge 

(D) STATE: PA 

20 (E) COUNTRY: USA 

(F) ZIP: 19482 

(v) COMPUTER READABLE FORM: 
(A) MEDIUM TYPE: Diskette 
25 (B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 



CURRENT APPLICATION DATA: 
) APPLICATION NUMBER: TO BE ASSIGNED 

(B) FILING DATE: 29-JAN-1999 

(C) CLASSIFICATION: UNKNOWN 



(vii) PRIOR APPLICATION DATA: 
35 (A) APPLICATION NUMBER: UK APPLICATION NO. TO BE ASSIGNED 

(B) FILING DATE: 20-JAN-1999 



40 



(A) APPLICATION NUMBER: EP APPLICATION NO. 98300694.1 

(B) FILING DATE: 30-JAN-1998 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Prestia, Paul F 

(B) REGISTRATION NUMBER: 23,031 

45 (C) REFERENCE/ DOCKET NUMBER: GP-30039 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-407-0700 

(B) TELEFAX: 610-407-0700 
50 (C) TELEX: 846169 



(2) INFORMATION FOR SEQ ID NO : 1 : 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
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ATGACGCCTC CTCCGCCCGG ACGTGCCGCC CCCAGCGCAC CGCGCGCCCG CGTCCCTGGC 60 

CCGCCGGCTC GGTTGGGGCT TCCGCTGCGG CTGCGGCTGC TGCTGCTGCT CTGGGCGGCC 12 0 

GCCGCCTCCG CCCAGGGCCA CCTAAGGAGC GGACCCCGCA TCTTCGCCGT CTGGAAAGGC 18 0 

CATGTAGGGC AGGACCGGGT GGACTTTGGC CAGACTGAGC CGCACACGGT GCTTTTCCAC 24 0 

GAGCCAGGCA GCTCCTCTGT GTGGGTGGGA GGACGTGGCA AGGTCTACCT CTTTGACTTC 300 

CCCGAGGGCA AG AAC G CAT C TGTGCGCACG GTGAATATCG GCTCCACAAA GGGGTCCTGT 3 60 

CTGGATAAGC GGGACTGCGA GAACTACATC ACTCTCCTGG AGAGGCGGAG TGAGGGGCTG 4 20 

CTGGCCTGTG GCACCAACGC CCGGCACCCC AGCTGCTGGA ACCTGGTGAA TGGCACTGTG 4 80 

GTGCCACTTG GC GAG AT GAG AGGCTACGCC CCCTTCAGCC CGGACGAGAA CTCCCTGGTT 54 0 

CTGTTTGAAG GGGACGAGGT GTATTCCACC ATCCGGAAGC AGGAATACAA TGGGAAGATC 600 

CCTCGGTTCC GCCGCATCCG GGGCGAGAGT GAGCTGTACA CCAGTGATAC TGTCATGCAG 660 

AACCCACAGT TCATCAAAGC CACCATCGTG CACCAAGACC AGGCTTACGA TGACAAGATC 72 0 

TACTACTTCT TCCGAGAGGA CAATCCTGAC AAGAATCCTG AGGCTCCTCT CAATGTGTCC 7 80 

CGTGTGGCCC AGTTGTGCAG GGGGGACCAG GGTGGGGAAA GTTCACTGTC AGTCTCCAAG 84 0 

TGGAACACTT TTCTGAAAGC CATGCTGGTA TGCAGTGATG CTGCCACCAA CAAGAACTTC 900 

AACAGGCTGC AAGACGTCTT CCTGCTCCCT GACCCCAGCG GCCAGTGGAG GGACACCAGG 960 

GTCTATGGTG TTTTCTCCAA CCCCTGGAAC TACTCAGCCG TCTGTGTGTA TTCCCTCGGT 1020 

G AC AT T G AC A AGGTCTTCCG TACCTCCTCA CTCAAGGGCT ACCACTCAAG CCTTCCCAAC 1080 

CCGCGGCCTG GCAAGTGCCT CCCAGACCAG CAGCCGATAC C C AC AG AG AC CTTCCAGGTG 114 0 

GCTGACCGTC ACCCAGAGGT GGCGCAGAGG GTGGAGCCCA TGGGGCCTCT GAAGACGCCA 1200 

TTGTTCCACT CTAAATACCA CTACCAGAAA GTGGCCGTCC ACCGCATGCA AGCCAGCCAC 12 60 

GGGGAGACCT TTCATGTGCT TTACCTAACT AC AG AC AG G G GCACTATCCA CAAGGTGGTG 1320 

GAACCGGGGG AGCAGGAGCA CAGCTTCGCC T T C AAC AT C A TGGAGATCCA GCCCTTCCGC 1380 

CGCGCGGCTG CCATCCAGAC CATGTCGCTG GATGCTGAGC GGAGGAAGCT GTATGTGAGC 14 4 0 

TCCCAGTGGG AGGTGAGCCA GGTGCCCCTG GACCTGTGTG AGGTCTATGG CGGGGGCTGC 1500 

CACGGTTGCC TCATGTCCCG AGACCCCTAC TGCGGCTGGG ACCAAGGCCG CTGCATCTCC 15 60 

ATCTACAGCT CCGAACGGTC AGTGCTGCAA TCCATTAATC CAGCCGAGCC ACACAAGGAG 1620 

TGTCCCAACC CCAAACCAGA CAAGGCCCCA CTGCAGAAGG TTTCCCTGGC CCCAAACTCT 168 0 

CGCTACTACC TGAGCTGCCC CATGGAATCC CGCCACGCCA CCTACTCATG GCGCCACAAG 174 0 

GAGAACGTGG AGCAGAGCTG CGAACCTGGT CACCAGAGCC CCAACTGCAT CCTGTTCATC 1800 

GAGAACCTCA CGGCGCAGCA GTACGGCCAC TACTTCTGCG AGGCCCAGGA GGGCTCCTAC 18 60 

TTCCGCGAGG CTCAGCACTG GCAGCTGCTG CCCGAGGACG GCATCATGGC CGAGCACCTG 1920 

CTGGGTCATG CCTGTGCCCT GGCCGCCTCC CTCTGGCTGG GGGTGCTGCC CACACTCACT 198 0 

CTTGGCTTGC TGGTCCACTA GGGCCTCCCG 2010 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 666 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Pro Pro Pro Pro Gly Arg Ala Ala Pro Ser Ala Pro Arg Ala 
15 10 15 

Arg Val Pro Gly Pro Pro Ala Arg Leu Gly Leu Pro Leu Arg Leu Arg 

20 25 30 

Leu Leu Leu Leu Leu Trp Ala Ala Ala Ala Ser Ala Gin Gly His Leu 

35 40 45 

Arg Ser Gly Pro Arg lie Phe Ala Val Trp Lys Gly His Val Gly Gin 

50 55 60 

Asp Arg Val Asp Phe Gly Gin Thr Glu Pro His Thr Val Leu Phe His 

65 70 75 80 

Glu Pro Gly Ser Ser Ser Val Trp Val Gly Gly Arg Gly Lys Val Tyr 

85 90 95 

Leu Phe Asp Phe Pro Glu Gly Lys Asn Ala Ser Val Arg Thr Val Asn 
100 105 110 
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He 


Gly 


Ser 


Thr 


Lys 


Gly 


Ser 


Cys 


Leu 


Asp 


Lys 


Arg 


Asp 


Cys 


Glu 


Asn 






115 










120 










125 








Tyr 


He 


Thr 


Leu 


Leu 


Glu 


Arg 


Arg 


Ser 


Glu 


Gly 


Leu 


Leu 


Ala 


Cys 


Gly 




130 










135 










140 










Thr 


Asn 


Ala 


Arg 


His 


Pro 


Ser 


Cys 


Trp 


Asn 


Leu 


Val 


Asn 


Gly 


Thr 


Val 


145 










150 










155 










160 


Val 


Pro 


Leu 


Gly 


Glu 


Met 


Arg 


Gly 


Tyr 


Ala 


Pro 


Phe 


Ser 


Pro 


Asp 


Glu 










165 










170 










175 




Asn 


Ser 


Leu 


Val 


Leu 


Phe 


Glu 


Gly 


Asp 


Glu 


Val 


Tyr 


Ser 


Thr 


lie 


Arg 








180 










185 










190 






Lys 


Gin 


Glu 


Tyr 


Asn 


Gly 


Lys 


lie 


Pro 


Arg 


Phe 


Arg 


Arg 


lie 


Arg 


Gly 






195 










200 










205 








Glu 


Ser 


Glu 


Leu 


Tyr 


Thr 


Ser 


Asp 


Thr 


Val 


Met 


Gin 


Asn 


Pro 


Gin 


Phe 




210 










215 










220 










He 


Lys 


Ala 


Thr 


He 


Val 


His 


Gin 


Asp 


Gin 


Ala 


Tyr 


Asp 


Asp 


Lys 


lie 


225 










230 










235 










240 


Tyr 


Tyr 


Phe 


Phe 


Arg 


Glu 


Asp 


Asn 


Pro 


Asp 


Lys 


Asn 


Pro 


Glu 


Ala 


Pro 










245 










250 










255 




Leu 


Asn 


Val 


Ser 


Arg 


Val 


Ala 


Gin 


Leu 


Cys 


Arg 


Gly 


Asp 


Gin 


Gly 


Gly 








260 










265 










270 






Glu 


Ser 


Ser 


Leu 


Ser 


Val 


Ser 


Lys 


Trp 


Asn 


Thr 


Phe 


Leu 


Lys 


Ala 


Met 






275 










280 










285 








Leu 


Val 


Cys 


Ser 


Asp 


Ala 


Ala 


Thr 


Asn 


Lys 


Asn 


Phe 


Asn 


Arg 


Leu 


Gin 




290 










295 










300 










Asp 


Val 


Phe 


Leu 


Leu 


Pro 


Asp 


Pro 


Ser 


Gly 


Gin 


Trp 


Arg 


Asp 


Thr 


Arg 


305 










310 










315 










320 


Val 


Tyr 


Gly 


Val 


Phe 


Ser 


Asn 


Pro 


Trp 


Asn 


Tyr 


Ser 


Ala 


Val 


Cys 


Val 










325 










330 










335 




Tyr 


Ser 


Leu 


Gly 


Asp 


He 


Asp 


Lys 


Val 


Phe 


Arg 


Thr 


Ser 


Ser 


Leu 


Lys 








340 










345 










350 






Gly 


Tyr 


His 


Ser 


Ser 


Leu 


Pro 


Asn 


Pro 


Arg 


Pro 


Gly 


Lys 


Cys 


Leu 


Pro 






355 










360 










365 








Asp 


Gin 


Gin 


Pro 


He 


Pro 


Thr 


Glu 


Thr 


Phe 


Gin 


Val 


Ala 


Asp 


Arg 


His 




370 










375 










380 










Pro 


Glu 


Val 


Ala 


Gin 


Arg 


Val 


Glu 


Pro 


Met 


Gly 


Pro 


Leu 


Lys 


Thr 


Pro 


385 










390 










395 










400 


Leu 


Phe 


His 


Ser 


Lys 


Tyr 


His 


Tyr 


Gin 


Lys 


Val 


Ala 


Val 


His 


Arg 


Met 










405 










410 










415 




Gin 


Ala 


Ser 


His 


Gly 


Glu 


Thr 


Phe 


His 


Val 


Leu 


Tyr 


Leu 


Thr 


Thr 


Asp 








420 










425 










430 






Arg 


Gly 


Thr 


He 


His 


Lys 


Val 


Val 


Glu 


Pro 


Gly 


Glu 


Gin 


Glu 


His 


Ser 






435 










440 










445 








Phe 


Ala 


Phe 


Asn 


He 


Met 


Glu 


He 


Gin 


Pro 


Phe 


Arg 


Arg 


Ala 


Ala 


Ala 




450 










455 










460 










He 


Gin 


Thr 


Met 


Ser 


Leu 


Asp 


Ala 


Glu 


Arg 


Arg 


Lys 


Leu 


Tyr 


Val 


Ser 


465 










470 










475 










480 


Ser 


Gin 


Trp 


Glu 


Val 


Ser 


Gin 


Val 


Pro 


Leu 


Asp 


Leu 


Cys 


Glu 


Val 


Tyr 










485 










490 










495 




Gly 


Gly 


Gly 


Cys 


His 


Gly 


Cys 


Leu 


Met 


Ser 


Arg 


Asp 


Pro 


Tyr 


Cys 


Gly 








500 










505 










510 






Trp 


Asp 


Gin 


Gly 


Arg 


Cys 


He 


Ser 


lie 


Tyr 


Ser 


Ser 


Glu 


Arg 


Ser 


Val 






515 










520 










525 








Leu 


Gin 


Ser 


lie 


Asn 


Pro 


Ala 


Glu 


Pro 


His 


Lys 


Glu 


Cys 


Pro 


Asn 


Pro 




530 










535 










540 










Lys 


Pro 


Asp 


Lys 


Ala 


Pro 


Leu 


Gin 


Lys 


Val 


Ser 


Leu 


Ala 


Pro 


Asn 


Ser 


545 










550 










555 










560 


Arg 


Tyr 


Tyr 


Leu 


Ser 


Cys 


Pro 


Met 


Glu 


Ser 


Arg 


His 


Ala 


Thr 


Tyr 


Ser 










565 










570 










575 




Trp 


Arg 


His 


Lys 


Glu 


Asn 


Val 


Glu 


Gin 


Ser 


Cys 


Glu 


Pro 


Gly 


His 


Gin 








580 










585 








590 






Ser 


Pro 


Asn 


Cys 


He 


Leu 


Phe 


lie 


Glu 


Asn 


Leu 


Thr 


Ala 


Gin 


Gin 


Tyr 






595 










600 










605 
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20 



35 



45 



Gly His Tyr Phe Cys Glu Ala Gin Glu Gly Ser Tyr Phe Arg Glu Ala 

610 615 620 

Gin His Trp Gin Leu Leu Pro Glu Asp Gly lie Met Ala Glu His Leu 
625 630 635 640 

.Leu Gly His Ala Cys Ala Leu Ala Ala Ser Leu Trp Leu Gly Val Leu 

645 650 655 

Pro Thr Leu Thr Leu Gly Leu Leu Val His 
660 665 

(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 712 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



CCGCCTGCCG CCCAGGGCCA CCTAAGGAGC GGATNCTANN TCTTCGCCGT CTGGAAAGGC 60 

CATGTAGGGC AGGACCGGGT GGACTTTGGC CAGACTGAGC CGCACACGGT GCTTTTCCAC 120 

25 GAGCCAGGCA GCTCCTCTGT GTGGGTGGGA GGACGTGGCA AGGTCTACCT CTTTGACTTC 18 0 

CCCGAGGGCA AGAACGCATC TGTGCGCACG GTGAATATCG GCTCCACAAA GGGGTCCTGT 24 0 

CTGGATAAGC GGGACTGCGA GAACTACATC ACTCTCCTGG AGAGGCGGAG TGAGGGGCTG 300 

CTGGCCTGTG GCACCAACGC CCGGCACCCC AGCTGCTGGA ACCTGGTGAA TGCACTGTGG 360 

TGCCACCTTG GCGAGAGTGG AGGCTACGCC CCCTTCAGCC CGGACGAGAA CGTCCCGTGG 4 20 

30 TTCTGTTTTG AAGGGGACGA AGTGTATTCC ACCATCCGGA AAGCAAGGAA TTACAATTGG 4 80 

GAAGATCCTC GGTTCCGCCG CATCCGGGGC GAG AG T GAG C TGTACACCAG TGATACTGTC 54 0 

ATGCAGAACC C AC AG T T CAT CAAAGCCACC ATCGTGCACC AAGACCAGGC T T AC GAT G AC 600 

AAGATCTACT ACTTCTTCCG AGAGGACAAT CCTGACAAGA ATCCTGAGGC TCCTCTCAAT 660 

GTGTCCCGTG TGGCCCAGTT GTGCAGGGGG GACCAGGGTG GGGAAAGTTC AN 712 



(2) INFORMATION FOR SEQ ID NO : 4 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 215 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 





Gly 


Gin 


Asp 


Arg 


Val 


Asp 


Phe 


Gly 


Gin 


Thr 


Glu 


Pro 


His 


Thr 


Val 


Leu 


50 


1 








5 










10 










15 




Phe 


His 


Glu 


Pro 

20 


Gly 


Ser 


Ser 


Ser 


Val 
25 


Trp 


Val 


Gly 


Gly 


Arg 

30 


Gly 


Lys 




Val 


Tyr 


Leu 
35 


Phe 


Asp 


Phe 


Pro 


Glu 
40 


Gly 


Lys 


Asn 


Ala 


Ser 
45 


Val 


Arg 


Thr 


55 


Val 


Asn 


He 


Gly 


Ser 


Thr 


Lys 


Gly 


Ser 


Cys 


Leu 


Asp 


Lys 


Arg 


Asp 


Cys 




50 










55 










60 












Glu 


Asn 


Tyr 


He 


Thr 


Leu 


Leu 


Glu 


Arg 


Arg 


Ser 


Glu 


Gly 


Leu 


Leu 


Ala 




65 










70 










75 










80 




Cys 


Gly 


Thr 


Asn 


Ala 


Arg 


His 


Pro 


Ser 


Cys 


Trp 


Asn 


Leu 


Val 


Asn 


Ala 


60 










85 










90 










95 




Leu 


Trp 


Cys 


His 
100 


Leu 


Gly 


Glu 


Ser 


Gly 
105 


Gly 


Tyr 


Ala 


Pro 


Phe 
110 


Ser 


Pro 




Asp 


Glu 


Asn 
115 


Val 


Pro 


Trp 


Phe 


Cys 
120 


Phe 


Glu 


Gly 


Asp 


Glu 
125 


Val 


Tyr 


Ser 
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Thr lie Arg Lys Ala Arg Asn Tyr Asn Trp Glu Asp Pro Arg Phe Arg 

130 135 140 

Arg lie Arg Gly Glu Ser Glu Leu Tyr Thr Ser Asp Thr Val Met Gin 
145 150 155 160 

Asn Pro Gin Phe lie Lys Ala Thr He Val His Gin Asp Gin Ala Tyr 

165 170 175 

Asp Asp Lys He Tyr Tyr Phe Phe Arg Glu Asp Asn Pro Asp Lys Asn 

180 185 190 

Pro Glu Ala Pro Leu Asn Val Ser Arg Val Ala Gin Leu Cys Arg Gly 

195 200 205 

Asp Gin Gly Gly Glu Ser Ser 
210 215 




